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ABSTRACT. The purpose of this note is to provide an existence result for the solution of fully coupled 
Forward Backward Stochastic Differential Equations (FBSDEs) of the mean field type. These equations 
occur in the study of mean field games and the optimal control of dynamics of the McKean Vlasov type. 



1. Introduction 

Stochastic differential equations of the McKean - Vlasov type are Ito's stochastic differential equa- 
tions where the coefficients depend upon the marginal distribution of the solution. In their partial 
differential form, they were introduced by Mark Kac's in his analysis of the Boltzmann equation for 
the density of particules in kinetic theory of dilute monatomic gases, and a toy model for the Vlasov 
kinetic equation of plasma (see ifTTlfL?! '). 

The purpose of this note is to provide an existence result for the solution of Forward Backward 
Stochastic Differential Equations (FBSDEs) of the McKean- Vlasov type. Following the wave of 
interest created by the pathbreaking work of Lasry and Lions on mean field games |[T3l [T4l [T31 . 
simple forms of Backward Stochastic Differential Equations (BSDEs) of McKean Vlasov type have 
been introduced and called of mean field type. Fully coupled FBSDEs are typically more involved and 
more difficult to solve than BSDEs. FBSDEs of mean field type occur naturally in the probabilistic 
analysis of mean field games and the optimal control of dynamics of the McKean Vlasov type as 
considered in |j4l [3j. See also H] [5J [T7]] for the particular case of Linear Quadratic (LQ) models. 
Detailed explanations on how these FBSDEs occur in these contexts and the particular models which 
were solved are given in Section [3]below. 

The existence proofs given in H and |[3 ] depend heavily on the fact that the problems at hand are 
in fact stochastic control problems and FBSDEs are derived from an application of a version of the 
stochastic maximum principle, and the compactness estimates are derived from the linear nature of the 
forward dynamics and strong convexity properties of the cost functions of the stochastic optimization 
problems. The purpose of this note is to provide a general existence result which does not depend 
upon strong linearity and convexity assumptions. Such an existence result is proven in Section[2] The 
proof relies on Schauder's fixed point theorem used in appropriate spaces of functions an measures. 
A short Section [3] concludes with a short discussion of applications to mean field games and control 
of McKean- Vlasov dynamics studied in (4| and Q. 
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2. Solvability of Forward-Backward Systems of McKean-Vlasov Type 

2.1. First Notation. All the processes considered in this note are defined on a probability space 
(Q, F, F) on which an m-dimensional Wiener process W_ = (Wt)o<t<r is defined. For each random 
variable/vector or stochastic process X, we denote by F x the law (alternatively called the distribution) 
of X. We shall denote by F = {F t )o<t<T the filtration of W and by H 2 ' n the Hilbert space 

rT 

rr2,n 



Z G M°' n ; E J \Z s \ 2 ds < oo} 



where H°' n stands for the collection of all M ra -valued progressively measurable processes on [0, T\. 
For any measurable space (E, £), we denote by V(E) the space of probability measures on (E, £) 
assuming that the cr-field £ is understood. When E is a normed space (most often E = R d in what 
follows), we denote by V P (E) the subspace of V(E) of the probability measures of order p, namely 
those elements of V(E) which integrate the p-th power of the distance to a fixed point (whose choice 
is irrelevant in the definition of V P (E)). For each p > 1, if \i and // are probability measures of order 
p, W p (/j,, p!) denotes the p-Wasserstein's distance defined as 



W p (n,n') = inf 



y\ p E ir(dx, dy) 



7r G V P {E x E) with marginals n and p! 



Notice that if X and X' are random variables of order 2 with values in E, then by definition we have 

W 2 (F X ,F X ,) < [E\X -X'H] 1 / 2 . 



2.2. Assumptions and Statement of the Main Existence Result. Our goal is to solve fully coupled 
McKean-Vlasov forward-backward systems of the general form: 

dX t = B(t,X t ,Y t ,Z t ,F {XuYt) )dt + Z(t,X t ,Y t ,F {XuYt) )dW t 
1 ' ' dY t = -F(t, X t , Y t , Z t , F (XuYt) )dt + Z t dW t , 0<t<T, 

with initial condition Xo = xq for a given deterministic point xq € and terminal condition 
Yt = G(Xt,F Xt ). Here, the unknown processes (X, Y, Z) are of dimensions d, p and p x m 
respectively, the coefficients B and F map [0, T) x R d x W J x R pxm x V 2 {FL d x W) into R d and R p 
respectively, while the coefficient E maps [0, T] x R d x R p x V 2 {R d x R p ) into R dxm and the function 
G giving the terminal condition maps R d x V2(R d ) into R p , all these functions being assumed to be 
Borel-measurable. Recall that the spaces V2(R d x R p ) and V2(R d ) are assumed to be endowed with 
the topology of the 2-Wasserstein distance W 2 defined earlier. 

We now state the standing assumptions of the paper. We assume that, for any t G [0,T], the 
coefficients B(t, •), F(t, •), T,(t, ■) and G satisfy: 

(Al). The coefficients B(t, •), F(t, •), £(t, •) and G are Lipschitz continuous, the Lipschitz prop- 
erty with respect to the measure arguments holding for the 2-Wasserstein distance. Precisely, we 
assume that there exists a constant L > 1, such that for any t G [0, T], x,x' G R d , y,y' G R p , 
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z, z' G R pxm and p, // G P 2 (R d x RP), 

x', y', //') - B(t, x, y, z, p)\ + \F(t, x , y', z, p!) - F(t, x, y, z, p)\ 
+ |S(t, x', y\ n') ~ S(t, x, y, m)| + //) - G(x, 

<L[|(x,y,z)-(x / ,y / ,z / )|+W 2 ( / u,^)]- 

(A2) The functions S and G are bounded, the common bound being also denoted by L. Moreover, 

for any t G [0, T], x G R d , y G W, z G R pxm and /x G P 2 (M dxp ), 



|B(t,a;,y,2,/i)| < L 
\F(t,x,y,z,p)\ < L 



l + \x\ + \y\ + \z\ + ( [ \(x',y')\ 2 dfx(x',y')) 

\JR d xRP J 

1+M+ ( [ \y'\ 2 dp(x',y')\ ' 

\JR d xRP / 



1/2-, 



(A3) The function £ is uniformly elliptic in the sense that for any t G [0, T], x G R d , y G R p and 
/i G P 2 (R d x RP) the following inequality holds 

E(t,x,y,ii)E(t,x,y,(i)^ > L _1 id 

in the sense of symmetric matrices, where Id is the d-dimensional identity matrix. Here and through- 
out the paper, we use the exponent t to denote the transpose of a matrix. Moreover, the function 
[0, T] 3 t <— > £(i, 0, 0, (5(0,0)) i s a l so assumed to be continuous. 
We can now state the main result of the paper. 

Theorem 1. Under (Al-3), the FBSDE ([TJ has a solution. 

The strategy of the proof was sketched in a simpler setting in [5]. We review this strategy before 
giving the details. Because of the Markovian nature of the set-up, we expect Y t and X t to be connected 
by a deterministic relationship of the form Y t = ip(t,X t ), <p being a function from [0, T] x M. d into 
W usually called the FBSDE value function. If this is the case, the law of the pair (Xt, Y t ) is entirely 
determined by the law of X t since the distribution P(x t ,Yt) °^ C^*' * s ec l ua l to C^d; '))(^x t )- 
For a random variable X with values in R d and for a measurable mapping ip from R d into R p , we 
shall denote by ip o Px the image of the distribution Fx of X under the map (Id,ip) '■ K d 9 x 
(x, ip(x)) G R d x R p . With this notation in hand, it is natural to look for a function ip : [0, T] x M. d — > 
W such that 

dX t = B(t, X t ,Y t , Z t , <p(t, •) o F Xt )dt + E(t, X t , y t , •) o P Xt )<^ 

under the constraint that Y t = tp(t, X t ) for t G [0, T]. Translating the above into a nonlinear PDE, 
cp appears as the solution to a nonlinear PDE of the McKean-Vlasov type. The strategy we use 
below consists in recasting the stochastic system ([2]) into a well-posed fixed point problem over the 
arguments (ip, (Px t )o<t<T)- The first step is to use ip(t, •) o Fx t as an input and then to solve (|2]) as 
a standard FBSDE. In order to do so, we use known existence results for standard FBSDEs. 

Remark 1. The proof of Theorem^jgiven in this note can be used to derive existence of a solution of 
([T]) when the law P(x t ,y t ) i s replaced by P(x t ,y t ,z t )- Indeed, Z t is also given by a function of X t in the 
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same way Yt is, since Zt = v(t, Xt) with v(t, x) = d x u(t, x)E(i, x, u(t, x), u(t, •) o fx t ) whenever 
Yt = u(t,Xt) (we prove below that d x u makes sense). We refrain from giving the details to keep 
the technicalities to a minimum, and because we do not know of a practical application of such a 
generalization. 

2.3. Preliminary. Our fixed point argument relies on the following lemma which puts together the 
existence and uniqueness result contained in Theorem 2.6 of Delarue [6 ] and the control of the FBSDE 
value function provided by Corollary 2.8 of |0: 

Lemma 1. On the top of (Al-3), let us also assume that B and F are bounded by L. Then, given 
a probability measure v' £ ), a deterministic continuous function u : [0, T] 3 t <^-t Ut € 

V2(^ d x W), and an initial condition (t,x) G [0, T] x M. d , the forward-backward system 

dX s = B(s,X s ,Y s ,Z s ,v s )ds + H(8,X a ,Y„v a )dW B 

<J) dY s = -F(s,X s ,Y s ,Z s ,v s )ds + Z s dW s , t<s<T, 

with Xt = x as initial condition and Yt = G(Xt, v') as terminal condition, has a unique solution, 
denoted by (X s ' x ,Y S ,X , Z s ' x )t< s <T- Moreover, the FBSDE value function u : [0,T] x W 1 3 (t,x) 
yt,x ^ jg>p - s ]j 0un( i e( i jyy a cons tant 7 depending only upon T and L, and is 1 /2-Holder continuous 
in time and Lipschitz continuous in space in the sense that: 

\u(t, x) - u(t', x')\ < F(\t - t'\ l l 2 + \x- x'\) , 

for some constant T only depending upon T and L. In particular, both 7 and T are independent ofv' 
and v. Finally, it holds Yg' x = u(s, Xt' x )for any t < s < T. 

The boundedness assumption on B and F is stronger than what is necessary for the result of 
Lemma [I] to hold. For instance, the result of only requires that the bound (A2) holds without \x\ 
in the right hand side. We will not need this extra level of generality. We use this existence result 
in the following way. We start with a bounded continuous function ip from [0, T] x M. d into MP and 
a probability measure p, on C([0, T], M. d ) which we want to think of as the law Fx of the solution, 
we denote by p t its marginal distributions on M. d , and we apply the above existence result for ([3]) to 
v' = ut and v t = (p(t, •) o pt for t € [0, T] and solve: 

dX t = B(t, X t , Y U Z U <p{t, •) o fiAdt + E(t, X t , Y t , ip(t, •) o m)dW t 

(4) 

dY t = -F(t, X t , Y t , Z t , ip(t, •) o p t )dt + Z t dWt, < t < T, 

with the terminal condition Yp = g{XT, Pt) and a prescribed initial condition Xq = xq G M d . The 
following estimate will be instrumental in the proof of the main result. 

Lemma 2. On the top of (Al-3), let us also assume that B and F are bounded by L. Then, there 
exists a positive constant T, depending on T and L only, such that, for any inputs (ip, p) and (<£>' ', //) 
as above, the processes (X, Y, Z) and (X' , Y' , Z') obtained by solving (|4]) with (<p, p) and (<p' , //) 
respectively, satisfy 

E[ sup \X t - X' t \ 2 ] +E[ sup \Y t - y/| 2 ] + E / \Z t - Z' t \ 2 dt 

0<t<T 0<t<T Jo 

/ T 

<t(w 2 (pt,^t) 2 + f W2(ip{t,-)op 4 ,<p'{t,-)op' t ) 2 dt 
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Proof. For small time T > 0, this estimate follows immediately from the main estimate Theorem 1.3 
p. 218 of O and the Lipschitz assumption (Al). We only need to show that one can extend it to 
arbitrarily large values of T. Notice that Lemma[T]gives the existence of the FBSDE values functions 
u and u' such that Y t = u(t, X t ) and Y( = u(t, X' t ) for all t € [0, T]. 

As in Corollary 2.8 of fH, we choose a regular subdivision = To < T\ < ■ ■ ■ < T/v-i < Tjy = 
T so that the common length of the intervals [Tj, Tj + i] is small enough in order to apply the main 
estimate Theorem 1.3 p. 218 of (6). For any i € {0, • • • , N — 1} we have: 

E[ sup \X t - X' t \ 2 ]+E[ sup \Y t -Y{\ 2 ]+E [ \Z t - Z' t \ 2 dt 

Ti<t<T i+1 T l <t<T i+1 JTi 

(6) < v(n\X Ti -X' T f] +E[\u(T i+1 ,X Ti+1 ) -u'(T l+1 ,X Ti+1 )\ 2 } 

+ J t ' +1 W 2 (<p(t, •) o ih, ip'(t, •) o fi'tfdt^j . 

We first consider the last interval [T/v-i, TW] corresponding to the case i = N — 1. Since T/v = T 
we have u(T, •) = G(-, lit) an d u'(T, •) = G(-,[i' T ) so that using the Lipschitz property of G we get: 

e[ sup |x 4 -x;i 2 ]+e[ sup \Y t -y/| 2 ] +e / i^-z;! 2 ^ 

T N -!<t<T T JV _ 1 <t<T " -/TV-i 

< r^EOXr^ - X^_J 2 ] + W 2 (fx T , Vt? + £ W 2 {<p(t, •) o ih, v'(t, •) o Lttfdt^j , 

this estimate being true for all ip, tp', fi, fi' and all possible initial conditions for the processes X_ 
and X/. Note that we can assume T > 1 without any loss of generality, and allow the value of T to 
change from line to line as long as this new value depends only upon T and L. Since the FBSDE 
value function u (resp. v!) depends only upon ip and ii (resp. ip' and //), we can choose to keep ip, 
tp 1 , \x, ix 1 , and set Xt n _ 1 = XL = x for an arbitrary x £ Then the above inequality implies 

sup \u{T N ^,x)-u'{T N ^,x)\ 2 <t(w 2 {lit,Ix' t ) 2 + [ W 2 (<p(t,-)o Li t ,<p'(t,-)o n' t ) 2 dt). 

We can now plug this estimate into inequality ([6]) with % = N — 2 to get: 

/•Tisr-i 

E[ sup \X t - X' t \ 2 ] + E[ sup |y t - y/| 2 ] + E / \Zt- Z' t \ 2 dt 

Tjv-2<*<Tjv-i T N _2<t<T N _± JT N _2 

<t(e[\X Tn _ 2 -XL n J 2 ] + W 2 (ix t ,ii' t ) 2 + f W 2 (p(t,-)oix t ,v'(t r )oix' t ) 2 dt 

As before, we can write what this estimate gives if we keep <p, tp', fx, /x', and set Xt n _ 2 = XL = x 
for an arbitrary x S M. d . 

sup \u{T N _ 2 ,x)-u'{T N „ 2 ,x)\ 2 <y(w 2 {ixt,ix' t ) 2 + [ W 2 (<p(t, •) o fx t , y'(t, •) o (x' t ) 2 dt). 

x£R d \ JT N ^2 J 
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Plugging this estimate into inequality (|6]> with i = N — 3 we get: 

/■Tjv-2 

E[ sup \X t - X' t \ 2 ] + E[ sup \Y t - y/| 2 ] + E / \Z t - Z' t \ 2 dt 

Tn-3 <i<TjV-2 7jV-3<t<7jV-2 JT N _ 3 

<rfE[|x Tjv _ 3 -x^ 3 |2 ] + W , 2(/UT; ^ )2+ /" T ^(^.j^^'^.jo^ 

Iterating and summing up these estimates we get (as before the value of the constants can change 
from line to line) 

E[ sup \X t -X' t \ 2 ] +E[ sup |Yt-y/| 2 ] +E / |Z t -Z;| 2 cft 

0<t<T 0<t<T Jo 

<r^(E[|X Ti -^| 2 ] + W 2 (/i T , M ' T ) 2 + / w 2 (^(v)<>/W(V)^/4) 2 *), 

i=0 ^ jTi ' 

from which we get the desired estimate ([5]) after noticing that for each i > 1, we have: 

E[|X Ti - X^| 2 ] < E[ sup \X t - X[\ 2 ) 

Ti—\<t<Ti 

< r ^E[| - X^_f] + W 2 (/i T , ^) 2 + £ W 2 (<p{t, •) o xx t , <p'(t, ■) o ^fdt^j . 
from which we easily conclude. □ 
We shall estimate the integral in the right hand side of (|5]) from the remark: 



w 2 { lH ^' t ) + w 2 ^{t,-)^ t )M^-M)) 

1/2- 

+ ( / Ifa-yOfoaOl^x) 



(V) 



2.4. Fixed Point Argument in the Bounded Case. In this subsection we still assume that the 
coefficients B and F are bounded by the constant L. For any bounded continuous function tp : 
[0,T] x R d -> MP and for any probability measure /x G P 2 (C([0, T]; if we denote by fj, t the 
time t marginal of /x, the map [0, T] 9tH- <^(i, •) o /x t G 7*2 x is continuous. So by Lemma 
[TJ there exists a unique triplet (X t , Y t , Z t )o<t<T satisfying Moreover, there exists a bounded and 
continuous mapping u from [0, T] x M. d into MP such that Yj = u(i, Xt). This maps the input (</>, /x) 
into the output (tt, Px) an d our goal is to find a fixed point for this map. We shall take advantage of 
the a-priori L°° bound on u to restrict the choice of the functions ip to the set: 

(8) Fx = {tp G C([0,T] x R d ;R p ); V(i, x) G [0, T] x M. d , \(p(t,x)\ < j}. 

Similarly, since the drift B and the volatility £ are uniformly bounded, the fourth moment of the 
supremum sup 0<i<T \X t \ is bounded by a constant depending only upon the bounds of B and S. 
Consequently, we shall choose the input measure /x in the set: 

(9) E 2 = {fieV 4 {C([0,T};R d )); [ sup K| 4 d/x(«;) < 7') 

I JC([0,T];R d ) 0<t<T J 
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for 7' appropriately chosen. We then denote by E the Cartesian product E = E\ X E2. We view E 
as a subset of the product vector space V = V\ x V2, where Vi = C&([0, T] x M d ; M p ) stands for the 
space of bounded continuous functions from [0, T) x R d into W, and V 2 = M b (C([0, T];M d )) for the 
space of finite signed measures on the space C([0, T]; M ) endowed with the Borel cr-field generated 
by the topology of uniform convergence. On V\ , we use the exponentially weighted supremum-norm 

\\h\\i = sup e - ^'!^^,^)!, 

(t,x)e[0,T]xR d 

and on V%, the Kantorovitch- Rubinstein norm 

||M|| 2 = sup(/ Fdfi; F£ Li Pl (C([0,T];IR d )), sup \F(h)\ < lj. 

lJC([0,T];R d ) heC([0,T];R d ) J 

Here, Li Pl (C([0, T];R d )) stands for the Lip-1 functions on C([0, T]; R d ) equipped with the metric of 
uniform convergence on compact subsets. 

We emphasize that E\ is a convex closed bounded subset of V\. Moreover, we notice that the 
convergence for the norm || • ||i of a sequence of functions in E\ is equivalent to the uniform conver- 
gence on compact subsets of [0, T] x Mr. Similarly, E2 is a convex closed bounded subset of V2 as 
the convergence of non-negative measures on a metric space for the Kantorovitch-Rubinstein norm 
implies weak convergence of measures. We now claim: 

Lemma 3. Assume that, in addition to (Al—3), the coefficients B and F are also bounded by L. 
Then, the mapping : E B ((/?, /x) (u, Fx) € E defined above is continuous and has a relatively 
compact range. 

Proof. We first check the continuity of <£. Given a sequence (tp n ,pr) in E converging towards 
((/?, jji) £ E with respect to the product norm on V\ x V2, and given the corresponding solutions 
(X n , Y n , Z n ) and (X, Y, Z) obtained by solving (|4]) with ((£> n , pr) and ((p, n) respectively, we have 
(compare with Q): (i) for any t G [0, T], W2{pt, Ht) -> as n -)• +00 since (/x™) n >i converges 
weakly towards \i t and the moments of order 4 of the measures (/i") n >i are uniformly bounded by 
7'; by boundedness of the moments of order 4 again, the integral with respect to t of W/2(^,Ai™) 
converges towards 0; (ii) by continuity and boundedness of 93, and by a similar argument, the integral 
with respect to t of W2(<p(t, -){pt), <f(t, •)(/ u ™)) 2 converges toward as n — > +00; (iii) since the 
sup-norms of all the ip n are not greater than 7, the tightness of the measures {p n ) n >\ together with 
the uniform convergence of (i^ n ) n >i towards ip on compact sets can be used to prove that 

lim sup / \(ip-ip n )(t,y)\ 2 dtf(y)=0- 
Similarly, V^C/^T, Mt) — ^ as n tends to +00. From ([6]> and ([5]), we obtain 

' 0, 



lim 

n— >+oo 



T 

E sup \X t - X?\ 2 + E sup \Y t - Y t n \ 2 + E / \Z t - Z?\ 2 dt 

0<t<T 0<t<T 



from which we deduce that Pjn converges towards Fx as n tend to +00 for the topology of weak 
convergence of measures and thus for || • [I2. Denoting by u n the FBSDE value function which is a 
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function from [0, T] x R d into W such that Y t n = u n (t, X?), and by u the FBSDE value function for 
which Yt = u(t, Xt), we deduce that 

lim sup E\\u n (t,X?) -u{t,X t )\ 2 ] = 0. 

n ^+°°0<t<T 

By Lemma [TJ we know that all the mappings (u n ) n >i are Lipschitz continuous with respect to x, 
uniformly with respect to n. Therefore 

lim sup E\\u n (t,X t ) -u(t,X t )\ 2 } = 0. 

n ^+°°o<t<T 

Moreover, by Arzela-Ascoli's theorem, the sequence (u n ) n >i is relatively compact for the uniform 
convergence on compact sets, so denoting by u the limit of a subsequence converging for the norm 
|| • ||i, we deduce that, for any t G [0, T], u(t, •) = u(t, •) Px t " a - S -- By Girsanov Theorem, Fx t is 
equivalent to Lebesgue measure for any t G (0, T], so that u(t, •) = u(t, •) for any t G (0, T]. By 
continuity of u and u on the whole [0, T] x ~R d , equality holds at t = as well. This shows that 
(n ra ) n >i converges towards u for || • ||i and completes the proof of the continuity of 3>. 

We now prove that <&(E) is relatively compact for the product norm of V\ x V-x- Given (u, v) = 
fi) for some (ip, y) G -E, we know from Lemma[T]that u is bounded by 7 and (1/2, l)-Holder 
continuous with respect to (t, x), the Holder constant being bounded by T. In particular, u remains 
in a compact subset of C([0, T] x M d ; M m ) for the topology of uniform convergence on compact sets 
as (</>, n) varies over E 1 . Similarly, v remains in a compact set when (ip, fj,) varies over E. Indeed, if 
Fx = v is associated to (y>, /x), the modulus of continuity of X is controlled by the fact that B and S 
are bounded by constants independent of </? and //. □ 

We have completed all the steps needed to get a quick proof of the main result of this subsection. 

Proposition 1. Assume that, in additition to (Al—3), the coefficients B, E, F and G are bounded by 
L. Then equation ([T]) has a solution. 

Proof. By Schauder's fixed point theorem, has a fixed point (</>, fi). As explained in our description 
of the strategy of proof, solving (|4]) with this (ip, fi) as input, and denoting by (X t , Y t , Z t )o<t<T the 
resulting solution, by definition of a fixed point, we have Y t = ip(t, X t ) for any t G [0, T], a.s., and 
Fx = n so that Fx t = Ht- in particular, ip(t, •) o fit coincides with Frx t ,YA- We conclude that 
(X u Y t , Z t ) <t<T satisfies Q. □ 

2.5. Relaxing the Boundedness Condition. We now complete the proof of Theorem [T] when the 
coefficients only satisfy (A 1-3). The proof consists in approximating B and F by sequences of 
bounded coefficients (B n ) n >\ and (F n ) n >\. 

Proof. (Theorem 1) For any n > 1, t G [0, T], x G M d , y G W, z G M pxm and /i G T> 2 (^ d x M m ), 
we set: 

5 n (t, x, y, /x) = IlW (S(t, x, y, z, /x)) , F n (t, x, y, z, M ) = 4 P > (F(t, x, y, z, M )) , 

where, for any integer /c, LT^-* is the orthogonal projection from M fc onto the /c-dimensional ball of 
center and of radius n. 

For each n, assumptions (Al-3) are satisfied with B" and F n instead of B and F, and we denote 
by (X n , Y n , Z n ) the solution of ([T]) given by Proposition[T]when the system ([T]) is driven by B n , F n , 
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S and G. As explained in the previous subsection, the process Y n satisfies Y t n = u n (t, X™), for any 
t G [0, T], for some deterministic function u n . The first step of the proof is to establish the relative 
compactness of the families (ii n ) n >i and (Px n )n>l- 

We notice first that the processes (Y n ) n >i are uniformly bounded by a constant that depends upon 
L only. Indeed, applying Ito's formula and using the specific growth condition (A2), we get: 

r T 

Vie[0,T], E[\Y t n \ 2 ] < C + CJ E[\Y s n \ 2 ]ds, 

for some constant C depending on T and L only. As usual, the value of C may vary from line to line. 
By Gronwall's lemma, we deduce that the quantity sup 0<t<T E[| Y t n | 2 ] can be bounded in terms of T 
and L only. Injecting this estimate into (A2) shows that the driver (— F n (t, X™, Y t n , Z™, IP(x t n ,Y™)))o<t<r 
is bounded by (C(l + |Y 4 n |))o<t<T> for a possibly new value of C. Using now the uniform bounded- 
ness of the function G giving the terminal condition Yt, we conclude that the processes (l^ n )o<t<T 
are bounded, uniformly in n > 1. Indeed, for any n > 1, the process (\Y t n \ 2 )o< t <T can be seen as a 
solution of a BSDE with random coefficients; thus, it can be compared with the deterministic solution 
of a deterministic BSDE with a constant terminal condition. In particular, there exists a positive con- 
stant, say 7, such that, for any n > 1 for any t G [0, T], \u n (t, X^)\ < 7. Since for each t G [0, T], 
X™ has a density with respect to Lebesgue's measure on R d (recall that B n is bounded and SS^ is 
bounded from above and from below away from 0), we deduce that u n is bounded by 7 on the whole 
[0, T] x R d , and for any n > 1. As a by-product, we get 

rT \ 2-| 

(10) E 



\Z n A 2 ds 







< C. 



When p = 1, Theorem 2.1 in [8] says that the value function 
Sobolev derivatives of order 1 in time and of order 2 in space of a quasilinear PDE. As mentioned 
in the conclusion of (H, the result remains true when p > 2, the value function u n then solving a 
system of quasilinear PDEs. Using the fact that the proofs of Theorems 1.1 and 1.3 of Q are local 
(even though they are stated under global assumptions), we deduce that the functions (n n ) n >i are 
continuous in (t, x), uniformly in n > 1, on any compact subsets of [0, T] x W 1 . In a similar way, by 
Theorem 2.7 of [7], the proof of which is also local, the functions ((u n (t, ■)) n >i)o<t<T are locally 
Lipschitz-continuous, uniformly in t G [0, T] and in n > 1. 

Applying Ito's formula to |X n | 2 , using the growth conditions (A2), the uniform boundedness of 



Y t n , the boundedness of £ and the bound ([10]), we can use Gronwall's lemma and get the existence of 
a finite constant C such that, for any n > 1, 

(11) E[ sup \X?\ 4 ] < C. 

0<t<T 

Using the bound (A2) for B n and the same constant L, together with the uniform boundedness of the 



paths Y t n , ( pLOp and ( |TT| ), it is easy to check that 

E[|Xf-X s n | 4 ] <C\t-s\ 2 

for all s,t £ [0, T] for a constant C independent of n, s and t. Consequently, Kolmogorov's criterion 
shows that the family (fx n )n>i of probability measures on C([0, T] ; R d ) is tight. Replacing Y t n by 
u n (t, X™) in £(£, X", Y t n , P(x™,y n ))» we see ma t each component of u n satisfies a PDE, and since 
x, u n (t, x), IP(x' l ,y n )) is locally Holder continuous uniformly in n > 1, we can use Theorem 4, 
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Chapter 7 Section 2 of and conclude that the gradients of these components are locally Holder 
continuous in (t, x) on compact subsets of [0, T) x uniformly in n > 1. Consequently, one can 
thus extract a subsequence (nk)k>i sucn that u' lk and d x ii nk converge uniformly on compact subsets 
of [0, T] x M. d and of [0, T) x M. d respectively, and Fx n k converges towards a probability measure 
H on C([0, T];M. d ). If we denote by u the limit of u nk , the function u is continuously differentiable 
with respect to x on [0, T) x W 1 , and d x u nk converges towards d x u uniformly on compact subsets 
of [0, T) x M. d . As before, we can use the uniform boundedness of the functions u n , the tightness of 
the sequence {^'x n k)k>i and the uniform convergence of u nu towards u on any compact subset of 
[0, T) x R d to conclude that for any t E [0, T], 

(12) lim W 2 (u nk (t,-)oF x n k ,u{t,-)ofi t ) = 0, 

p— >+oo x t ' 

where /x t stands for the marginal law of [i of time index t. Now for each R > 0, we denote by tJj 
the first time the process JC 1 leaves the ball of center and radius R, in other words = inf {t > 
0; \Xf-\ > R}. Using the uniform convergence of (u nk , d x u nk ) to (u,d x u) on compact subsets of 



[0, T) x R d together with ([TO]), we have for each fixed R > 0: 



lim sup E[ sup \Xt k+l -Xt k+e '\ 2 ] =0. 



fc->+oo^,> < t < T »*-W AT ~*-W' 



Since ( [TT] ) implies that: 



lim sup E[ sup |X t nfe+< '-X" fe ^| 2 l 



R->+oo l t £i>i 0<t<T \ i; "r 

we must have: 

lim sup E[ sup |Xf fc+£ - X t nfc+f T] = 0, 
fc-H-°°£,£'>0 0<t<T 

which shows that the sequence (X n *)fc>i is a Cauchy sequence. We denote by X_ the limit. Since for 
each n > 1 we have: 

for < t < T, with t; n (t,x) = d x u n (t, x)T,(t, x, u n (t, x), u n (t, •) oP I( »), we can take the limit 
along the subsequence (nk)k>i and since /i is the law of X_, the local uniform convergence of u nk 
and d x u nk towards u and d x u, together with ([12]) imply that 



dX t = B(t,X t ,u(t, X t ),v(t,X t ), u(t,-) o¥ Xt )dt + E(t,X t ,u(t,X t ),u(t,-)oF Xt )dWt 

for < t < T, with v(t, x) = d x u(t, x)T,(t, x, u(t, x), u(t, ■) o Px t ). which is exactly the forward 
component of the McKean-Vlasov FBSDE ([T]) provided we set Y t = u(t, X t ) and Z t = v(t, X t ) for 
t G [0,T). It is plain to deduce that the sequences (Y nk )h > i and (Z_ Uk )k>i are Cauchy sequences for 
the norms E[sup < s <r I -s | 2 ] 1//2 and E[J Q T | - s | 2 cZs] 1//2 respectively. Denoting the respective limits by 
Y and Z, it holdsriP~a.s., for any t G [0, T], Y t = u(t, X t ), and, ¥ ® dt a.e., Z t = v(t, X t ). Passing 
to the limit in ([T]) with (B,F) therein replaced by (B Uk , F nk ), we deduce that (X, Y, Z) satisfies 

0- □ 
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2.6. Counter- Example to Uniqueness. We close this section with a counter-example showing that 
uniqueness cannot hold in general under assumptions (Al-3), even in the case d = m = p = 1. 
Indeed, let us consider the forward-backward system 

dX t = B{E{Y t ))dt + dW t , X = x , 

dY t = -F(E(X t ))dt + Z t dW u Yt = G(E(X T )), 

where B, F and G are real valued bounded and Lipschitz-continuous functions on the real line. Let 
us also assume that they coincide with the identity on [-R, R] for R large enough. In other words, 
we assume that B{x) = F{x) = G(x) = x for \x\ < R. For T = 7r/4 and for any iel, the pair 

xt = ^4sin(t), yt = Acos(t), < t < T = — , 

satisfies 

xt = yt, yt = -x t , < t < T, 
yr = x Tl 

with xq = as initial condition. Therefore, for \ A\ < R, (As'm(t), Acos(t))o<t<T is a solution to 
the deterministic forward-backward system 

x t = B(y t ), y t = -F{x t ), 0<t<T, 

y T = G(X T ), 

with xq = as initial condition. For such a value of A, set now: 

X t = x t + Wt, Y t = y t , 0<t<T. 

Then, (X, Y, 0) solves 

dX t = B(E(Y t ))dt + dW t 
dY t = -F(E(X t ))dt + 0dW t , 
with Xq = and Yp = G(E(Xt)), so that uniqueness fails. 

Remark 2. The reason for the failure of uniqueness can be explained as follows. In the standard 
framework, as explained in @, uniqueness holds because of the smoothing effect of the diffusion 
operator in the spatial direction. However, in the McKean-Vlasov setting, the smoothing effect of the 
diffusion operator is ineffective in the direction of the measure variable. 

3. Applications 

We now describe the set-up of the two applications mentioned in the introduction. 

We define the set A of admissible controls as the set of progressively measurable processes a = 
(at)o<t<T with values in a measurable space (A, A). Typically, A is a Borel subset of a Euclidean 
space M. k , and A the cr-field induced by the Borel cr-field of this Euclidean space. For each admissible 
control process a we consider the solution X = (Xt)o<t<T of the (nonlinear) stochastic differential 
equation of McKean-Vlasov type 

(14) dX t = b(t,X t ,Fx t ,at)dt + a(t,X t ,F Xt ,a t )dWt 0<t<T, X = x , 

where the drift b and the volatility a are deterministic measurable functions b : [0, T] x W 1 x V2(^- d ) x 
A -> R d and a : [0, TjxR^x P 2 (K d ) X A ->■ R dxm satisfying regularity conditions to be specified 
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below. The stochastic optimization problem associated to the optimal control of McKean-Vlasov 
stochastic dynamics is to minimize the objective function 

(15) J(o)=e(J f(t,X t ,F Xt ,a t )dt + g(X T ,F XT )\, 

over a set a G A where the running cost function / is a real valued deterministic measurable function 
defined on [0, T] x R d x 

V2{^ d ) x A and the terminal cost function g is an M d -valued deterministic 
measurable function defined on R d x V2(R d )- Though different in its goal, the mean field problem 
as stated by Lasry and Lions is similar in many ways. The main difference comes from the fact that, 
because the ultimate goal is to identify approximate Nash equilibriums for large games, the stochastic 
optimization problem is to minimize the objective function 

(16) J(a)=mU f(t,X t ,iH,at)dt + g(X T ,iMr)\, 

over the same set A of admissible control processes when the measure argument in the dynamics of the 
state, the running cost function / and the terminal cost function g are fixed - one often say frozen to 
emphasize this difference - and given by a deterministic flow fi = (yt)o<t<T of probability measures 
on R d . The similarity comes from the fact that, once the optimization for y fixed is accomplished, 
this flow of measures is determined in such a way that the marginal distribution of the solution of the 
stochastic differential equation 

(17) dX t = b(t,X t , t i t ,a t )dt + o-(t,X t ,m,a t )dW t 0<t<T, X = x , 

is fi t , in other words in such a way that fi t = Px t f° r all < i < T, forcing the optimally controlled 
state process X to actually solve the McKean - Vlasov equation ( [T4] ). If an appropriate stochastic 
version of the Pontryagin maximum can be proven, the probabilistic approach to these stochastic 
optimization problems is to introduce adjoint processes ^Y_,Z) = (Y t , Z t )o<t<T solving the adjoint 
BSDE, and to couple these backward equations with the forward equations ( fT7| ) and ( [14] ) by intro- 
ducing a control a minimizing a specific Hamiltonian function. This coupling of the forward and 
backward equations creates a FBSDE of the McKean-Vlasov type in which the marginal distributions 
of the solutions appear in the coefficients, and which needs to be solved in order to find a solution of 
the stochastic optimization problems. This was the motivation behind the statement of the problem 
considered in Section [2] 

For the sake of completeness we assume that, in both applications, the drift coefficient b and the 
volatility matrix a satisfy the following assumptions. 

(51) The functions (b(t, 0, 5$, 0))o<t<T and (cr(t, 0, do, 0))o<t<T are continuous; 

(52) 3c > 0, Vt G [0, T], Va G A, Vx, x' G R d , V//, yl G V 2 {R d ), 

\b(t, x, fi, a) — b(t, x , fj, , a)\ + \cr(t, x, fi, a) — a(t, x , //, a)\ < c(\x — x'\ + W2(fJ>, (J?)) 

We further restrict the set A to the set of progressively measurable processes a in M 2,k . Together with 
the Lipschitz assumption (S2), this definition guarantees that, for any a G A, there exists a unique 
solution 2L = XJ^ of both ( fT7| ) and ( fR] ) and that moreover, this solution satisfies 

E[ sup \X t \ q \ < oo 

0<t<T 

for every q > 1. See for example |[T6l[T0l for a proof for the McKean-Vlasov equations. 
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3.1. Solvability of FBSDE for Mean-Field Games. In this subsection, we apply the abstract exis- 
tence result of Section|2]to the Mean Field Game (MFG) problem described in the introduction. As in 
H, we assume that the volatility function is a constant matrix a (of size d x m), but here we assume 
in addition that det(<JC7^) > 0. Since the stochastic optimization problem is solved after the flow of 
measures is frozen, for each fixed fi, the Hamiltonian of the system reads: 

(18) H^ l (t,x,y,a) = b(t,x,n,a) -y + f(t,x,fi,a), x, y £ R d , a £ A, fj, £ V 2 {R k ), 

'•' standing for the inner product. For each \i, we assume the existence of a function (t,x,y) 

x, y) G A which is Lipschitz-continuous with respect to (x, y, fi), uniformly in t £ [0, T], the 
Lipschitz property holding true with respect to to the product distance of the Euclidean distances on 
R d x R d and the 2-Wasserstein distance on T^CR* 2 ) and such that: 

(19) a»(t,x,y) G argmin QeRd ^(t, x, y, a), t £ [0,T], x,y £ R d , £ V 2 (R d ). 

The existence of such a function was proven in [4 ] under specific assumptions on the drift b and the 
running cost function / used there. Using a standard version of the stochastic maximum principle, 
and coupling the forward dynamics and the adjoint BSDE by plugging the optimizer ( p~9] > lead to the 
solution, for each frozen flow fi = {nt)o<t<T of measures, of a standard FBSDE. Now, if we add 
the requirement that m should coincide for each t with the marginal distribution of the optimally 
controlled state, the solution of the MFG stochastic optimization problem reduces to the solution of 
the FBSDE of McKean - Vlasov type 

(20) 

dX t = b(t, X t ,F Xt ,a(t, x t, Y t , F Xt ))dt + adW t , 

dY t = -d x f(t,X t ,F Xt ,a(t,X t ,Y t ,F Xt ))dt-d x b(t,X u F Xt ,a(t,X t ,Y t ,F Xt )) -Y t dt + Z t dW t , 

with initial condition Xq = xq for a given deterministic point xq £ R d , and terminal condition 
Yt = d x g{XT, F Xt ). Existence of a solution for this McKean- Vlasov FBSDE follows by applying 
Theorem [T] if we assume that d x f and d x g are bounded and Lipschitz-continuous and set: 

B(t, X t , Y t , Z t ,F {XuYt) ) = b(t, X t ,F Xt ,at), 

F(t, X t , Y t , Z t ,F {XuYt) ) = d x f{t, X t ,F Xt ,a(t, X t , Y t ,F Xt )) 

+ 8 x b{t, X t , F Xt , a(t, X t , Y t , F Xt )) • Y t , 

G{X T ,F XT ) = d x g{X T ,F XT ). 

While the result of the present note provides existence in quite a general set up, [4] also allows for 
running costs with at most linear growth in x (and provides much more in terms of the identification 
of approximate Nash equilibriums). However, the drift b needs to be of a very specific affine form, 
namely b(t, x, y, a, //) = bo(t, n) + b\(t)x + 62(^)0 for some deterministic functions bo, b\ and b 2 , 
and the running cost function / has to satisfy a strong convexity assumption, the latter allowing a to 
be degenerate, providing a more efficient approximation procedure to reduce the result to the bounded 
case and ensuring the validity of the converse of the stochastic maximum principle. 

3.2. Optimal Control of McKean- Vlasov Stochastic Dynamics. Finally, we explain how the exis- 
tence result of this paper generalizes the existence result of [3 ] where a solution of the optimal control 
of stochastic differential equations of the McKean- Vlasov type is given. 
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As before, we assume that the volatility function is a constant matrix a such that det(aa Jf ) > 0, so 
the Hamiltonian has the form 

H(t, x, y, n, a) = b(t, x,/j,,a)-y + f(t, x, n, a), 

for t G [0, T], x,y G R d , a G A and G T^O^). Again, we assume the existence of a function 
(t, x, y, /i) a(t, x, y, fi) G A which is Lipschitz-continuous with respect to (x, y, fi), uniformly in 
t G [0, T], the Lipschitz property holding true with respect to the product distance of the Euclidean 
distances on R d x R d and the 2-Wasserstein distance on V2(R d ) and such that: 

(21) &(t,x,y,fi) G argmin amd H(t,x,y,n,a), t G [0, T], x, y G R d , /x G P 2 (M d ). 

The existence of such a function was proven in [3] under specific assumptions on the drift b and the 
running cost function / used there. However, the major difference with the mean field game problem 
comes from the form of the adjoint equation which now involves differentiation of the Hamiltonian 
with respect to the measure parameter. A special form of adjoint equation was introduced, and a new 
stochastic maximum principle was proven in |3]. Once this new form of adjoint equation is coupled 
with the forward dynamical equation through the plugged-in optimal control feedback a defined in 



( |2T| ), the associated McKean-Vlasov FBSDE takes the form 

dX t = b(t,X t ,F Xt ,a(t,X t ,Y t ,F Xt ))dt + adW t , 

dY t = -d x f{t,X t ,F Xt ,a(t,X t ,Y t ,F Xt ))dt 

(22) -d x b(t,X t ,F Xt ,a(t,X t ,Y t ,F Xt )) -Y t dt + Z t dW t 

-t%f[t, X t ,F Xt ,a(t, X t , Y t , F Xt )) (X t )] dt 

- E [d^b(t, X t , F Xt , a(t, X t , Y t ,F Xt )) (X t ) ■ Y t ] dt, 

with initial condition Xq = xo for a given deterministic point xo G R d , and terminal condition 
Yt = d x g(XT,F Xr ) + K[d fl g(XT,F XT )(XT)]- Above, the tilde refers to an independent copy 
of (X t , Y t , Z t ). The notations d^b(t, x, Pg, a)(£) and d^f(t, x, F^, a)(£), for an M d -valued random 
variable £ of order 2, represent the derivatives of b and / with respect to the measure argument at 
(t, x, Pf , a) according to the rule of differentiation introduced by P.L. Lions as explained in 0: the 
derivatives are represented by random variables, obtained by plugging the current value of £ into some 
functions x, F^, a)( • ) and d^f(t, x, Pg, «)(•)• We chose the order in which the various terms 

appear in the right hand side of the backward equation in order to emphasize the last two terms which 
are not present in standard forms of adjoint BSDEs. Because of these two last terms, the coefficients 
depend on the joint law of (X t , Y t ), which is not the case in p0] >. Despite this non-standard form of 
the FBSDE, the existence result of this note applies (under appropriate conditions on the coefficients). 
Under very restrictive conditions on the drift b and a strict convexity assumption on /, [3] provides 
uniqueness, allows for at most linear growth in the variable x and guarantees the validity of the 
converse of the maximum principle. 
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